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Abstract — In this paper, we propose a joint transceiver design 
for single-carrier frequency-domain equalization (SC-FDE) based 
multiple-input multiple-output (MIMO) relay systems. To this 
end, we first derive the optimal minimum mean-squared er- 
ror linear and decision-feedback frequency-domain equalization 
filters at the destination along with the corresponding error 
covariance matrices at the output of the equalizer. Subsequently, 
we formulate the source and relay precoding matrix design 
problem as the minimization of a family of Schur-convex and 
Schur-concave functions of the mean-squared errors at the output 
of the equalizer under separate power constraints for the source 
and the relay. By exploiting properties of the error covariance 
matrix and results from majorization theory, we derive the 
optimal structures of the source and relay precoding matrices, 
which allows us to transform the matrix optimization problem 
into a scalar power optimization problem. Adopting a high signal- 
to-noise ratio approximation for the objective function, we obtain 
the global optimal solution for the power allocation variables. 
Simulation results illustrate the excellent performance of the 
proposed system and its superiority compared to conventional 
orthogonal frequency-division multiplexing based MIMO relay 
systems. 



I. Introduction 

Multiple-input multiple-output (MIMO) relay systems uti- 
lizing multiple antennas at the relay node have recently 
received significant research interest due to their potential 
to enhance network performance 11]. An important research 
problem for MIMO relay systems is the design of optimal node 
processing matrices to improve spectral efficiency and/or error 
performance through efficient utilization of transmit channel 
state information (CSIT). For example, assuming availability 
of CSIT at the source and relay nodes and linear processing 
at the destination, the source and relay processing matrices 
were optimized for maximization of the relay channel capacity 
and minimization of the mean-squared error (MSB) in (2], 
Ul and |4|, |5|, respectively. In l^l, a general framework 
for linear transceiver optimization in MIMO relay systems 
was provided for a large family of objective functions, which 
includes the capacity maximizing and the MSB minimizing 
designs as special cases. The extension of the results in [6J 
to multi-hop MIMO relay systems with linear and decision- 
feedback equalization receivers was investigated in |7| and 
IS), respectively. More recently, the design of MIMO relay 
systems with partial or imperfect CSIT at source and relay 
was considered in ||9l, IfTOl . 

The authors are with the Department of Electrical and Computer Engineer- 
ing, University of British Columbia, Vancouver, BC Canada, V6T, 1Z4, email: 
{peiranw, rschober, vijayb}@ece.ubc.ca. This paper will be presented in 
part at the IEEE Global Communications Conference (Globecom), Anaheim, 
December 2012. 



Bxisting works on transceiver design for MIMO relay sys- 
tems are based on the assumption of frequency-nonselective 
(flat) channels ID, lU, Q, lHJ, IM or frequency-selective 
channels in combination with orthogonal frequency-division 
multiplexing (OFDM) |2|, [6|, |9|. Since OFDM decomposes 
a frequency-selective channel into multiple parallel flat sub- 
channels, the transceiver designs developed for frequency- 
nonselective channels can be extended to OFDM based MIMO 
relay systems by solving an additional subcarrier power al- 
location problem across different subcarriers. However, its 
large peak-to-average power ratio (PAPR) makes OFDM less 
appealing for application in the uplink of wireless com- 
munication systems. Block based single-carrier transmission 
with frequency-domain equalization (SC-FDB) is a promising 
alternative to OFDM due to its comparable implementation 
complexity and lower PAPR [TTl, [12|. To the best of the 
authors' knowledge, the optimization of SC-FDB based MIMO 
relay systems has not been considered in the literature so far. 
A key difference between SC-FDB based MIMO systems and 
MIMO-OFDM systems is that the performance of the former 
depends on the MSBs of each spatial stream whereas the 
performance of the latter depends on the subcarrier MSBs. 
This important difference makes the optimization of SC-FDB 
based MIMO systems more challenging than the optimization 
of MIMO-OFDM systems. 

In this paper, we make the common assumption of perfect 
CSI at all nodes [|2l-|[8l and we propose a joint transceiver 
design for MIMO relay systems employing either frequency- 
domain linear equalization (FD-LB) or frequency-domain de- 
cision feedback equalization (FD-DFB) at the destination. 
We optimize the source and relay precoding matrices for 
minimization of a general function of the MSBs of the spatial 
streams under separate power constraints for source and relay. 
Specifically, as objective functions we adopt the arithmetic 
MSB (AMSB), the geometric MSB (GMSB), and the maxi- 
mum MSB (maxMSB) (|6], (jlll, which ai-e closely related to 
channel capacity and error rate performance. For the case of 
FD-LB, we show that the optimal source and relay precoding 
matrices have a structure very similar to that of the optimal 
precoding matrices in MIMO-OFDM relay systems. However, 
the remaining power allocation problem is significantly dif- 
ferent from the power allocation problem for MIMO-OFDM 
relay systems, especially for the GMSB and maxMSB criteria. 
For FD-DFB, the considered objective functions cannot be 
explicitly expressed in terms of the optimization variables and 
depend on the number of feedback filter taps, which makes 
a direct solution of the optimization problem challenging. 
However, we can show that for FD-DFB, the three considered 
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objective functions are equivalent. Furthermore, we develop an 
upper bound for the objective function which is independent of 
number of feedback filter taps and is a comparatively simple 
function of the optimization variables. Interestingly, this upper 
bound is shown to be identical to the GMSE objective function 
for the FD-LE receiver. Consequently, a unified solution for 
the power allocation problem for both FD-LE and FD-DFE can 
be obtained, which greatly simplifies the design procedure. 

The remainder of this paper is organized as follows. In 
Section II, the system model is presented. In Section III, the 
optimal minimum MSE (MMSE) FDE filters and the corre- 
sponding stream error covariance (CV) matrices are derived. 
The optimal source and relay precoding matrices are presented 
in Section IV. Simulation results are given in Section V, and 
some conclusions are drawn in Section VI. 

In this paper, tr(A), A^^, A^, and At denote the 
trace, inverse, transpose, and conjugate transpose of matrix 
A, respectively. C*^^^ denotes the space of all M x N 
complex matrices and Im is the M x M identity matrix, 
n ^ CJV{0,allj^j) indicates that n e £Mxi ^ complex 
Gaussian distributed vector with zero mean and CV matrix 
ct^Im- E[-] and (g) denote statistical expectation and the Kro- 
necker product, respectively. blkcirc([A^, A^, Ajj]"^) and 
blkdiag([A|^, A^, Aj^f]-^) denote a block circular matrix 
and a block diagonal matrix, respectively, formed by the 
block-wise vector [A^, A^^, A^j]^. Fjv denotes the x 
Discrete Fourier Transform (DFT) matrix, and x* denotes the 
optimal value of x. 

II. System Model 

We consider a block transmission system with one source 
node, S, one relay node, R, and one destination node, D, as 
shown in Fig. 1. The numbers of antennas at S, R, and D 
are denoted by Ng, Nr, and Nd, respectively. The number of 
spatial multiplexing data streams is M < min{A^s, A'^^, ^d}- 
The transmission is organized in two phases. In the first phase, 
S processes the information symbols and sends them to R. 
In the second phase, R processes the received signal and 
retransmits it to D. We assume there is no direct link between 
S and D due to the large pathloss and/or shadowing. 

The transmit signal of each source antenna is prepended 
by a cyclic prefix (CP), which comprises the last Ng,s > Lg 
symbols of the transmitted source signal, where Lg denotes 
the largest channel impulse response (CIR) length between 
any S-R antenna pairj^ Similarly, the transmit signal of each 
relay antenna is prepended by a CP, which comprises the last 
Ng,r > Lh symbols of the transmitted relay signal, where Lh 
is the largest CIR length between any R-D antenna pair. 

A. Precoding at Source and Relay 

Let us denote the nth source data symbol vector as s„ — 
[s„(l), s„(2), . . . , Sr,{M)Y, n = 0, . . . , A^c - 1, where A^, is 
the size of the data block, and denotes the nth symbol 

of the jth data stream, which is drawn from a constellation 
with variance ct^. By stacking all s„ into one vector, we obtain 

' For simplicity of presentation, tlie CP insertion is not sliown in Fig. 1 . 



s = [s^, . . . , s^^_]^]^ e (^MN^Yi^ jjjg received signal at the 
destination, y, can be compactly written as 

y-HtAtGtPtS + HtAtV + u (1) 
with block circular matrices 

Pi=blkcirc([P?;o,-- - ,P^^^_i]^), 

Gt = blkcirc([G^o, • • • , G^i_^_i, Ojv, x7V,(7V,-Lg)]'^), 

At = blkcirc([A^o> • ' • .^In,-iV). 

Hf = blkcirc([H^o, • • • ,'RlL^_^,QM^y.N^(N^^L^,)f), 

where Vtj e C^-><*^ Gu e C^-x^% A*,; e c^.xjv.^ 
and Hf ; G C^'^^^'' denote the kh tap of the time-domain 
(TD) source precoding filter, the S-R channel, the TD relay 
precoding filter, and the R-D channel, respectively. The noise 
vectors at R and D are denoted by 

V = [v^, . . . , v?;^_i]^ ~ CA/'(0, a^lAT^^J, 

u = [u^, . . .,ul^_,f ~ CM{0,allN,Nj, (2) 

where v„ = [w„(l), w„(2), . . . , t;„(Afr)]^ and u„ = 
u„(2), . . . , Un{Nd)Y' denote the additive white Gaus- 
sian noise (AWGN) vectors at R and D at time n, respectively. 
The block circular matrices {Pt, Gt, At, Ht} can be decom- 
posed as 

Pt = fIy^P/Fm, Gt - f]v^G/Fjv., 

At = Fj^__A/Fjv„, Ht = Fjv^H^F^^, (3) 

with Ft = Fjv It, T e {Af, A^,, A^,, A^d}, X/ = 
blkdiag([Xj,.-. :X?;^_J^), and X/ e {P/, G^, A/, H/}. 
Here, Pfc G C^=><*^, Gfc G C^-^^s A^ G C^'-^^% 
and Hfc G C^'^^^'' represent the frequency-domain (FD) 
source precoding, S-R channel, relay precoding, and R-D 
channel matrices for the fcth frequency tone, respectively. 
We define the equivalent end-to-end channel matrix Qt = 
HtAtGtPt and express it as Qt = Pjy Q/Fm, where Q/ = 
blkdiag([Q;f,--- ,Q?;^_,]^) with Qfe = HfcAfeGfePfe G 
C^dxM representing the equivalent S-D channel matrix on 
the fcth frequency tone. Furthermore, the CV matrix of the 
equivalent noise vector n = Hf AtV + u can be obtained as 

K = £[nnt] = f]^^K/F^„ (4) 
where K/ = altl f A f AjH^j + o-^Iat^at,. 

B. Equalization at the Destination 

The received signal y is transformed into the FD using 
F Md and equalized by an FD feedforward filter (FFF) W / — 
blkdiag([WQ , • • ■ ,W^^_;^]^). The resulting signal is then 
transformed into the TD using f]^^ resulting in 

y = Wty, (5) 

where Wf = F^.^W/F is the equivalent TD 
FFF and y = [yj, . . . , y^^_ with y„ 
[j/n(l),2/n(2), . . . ,2/„(M)]^ denoting the nth signal vector 
at the output of the FFF. If FD-LE is employed, y„ is the 
decision variable for the nth source symbol vector On the 
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Fig. 1. System model for a MIMO relay system with SC-FDE at the destination. 



Other hand, for FD-DFE, y„ is further processed using a TD 
feedback filter (FBF) to perform interference cancelation. 
Assuming correct feedback at the output of the slice:@, the 
signal corresponding to the mth data stream at time n at the 
input of the slicer is given by 

yn{m) = yn{m) - ^[Bt,/](„_.)S(„_i)modJV,, (6) 
/=0 

where Bf ^ denotes the coefficient matrix of the Ith tap of the 
FBF, [X](„j :) stands for the mth row of matrix X, Nfi, is the 
number of feedback taps, and (•)modA^ denotes the modulo- 
N operation. From (|6]l we observe that at the initial stage of 
the feedback process, i.e., when n = 0, [sjv^^jvy:^, ■ • • i^atJ 
has to be known a priori, which can be accomplished by 
using known training symbols. Nevertheless, for detection of 
s„(to), [s„(1),-- - ,Sn{'m— 1)] is still unknown. Therefore, 
for causal detection, the 0th tap of the FBF, i.e., Bt o, has 
to be a lower triangular matrix with zero diagonal entries. By 
collecting all ynim) into a vector y = [y;^, . . . , y^ _-^]'^ with 
y,i = [yn{l), y™(2), . . . , j/„(M)]^, we arrive at 

y = y - Bts, (7) 

where Bt = blkcirc([B^o, • • • , Bf j^^^,OMxMiN^-Nf^-i)]'^) e 
^MN^xMN, is the equivalent TD-FBF Thus, the error vector 
at the input of the slicer can be expressed as 

e y - s = y - (BH-IalwJs = y - Cts, (8) 

Ct 

where Ct = blkcirc([C?;o, • • • , C?;^^^, 0^]^) with Ct,„ = 
Bf^„,Vn / and Cf.o = Bf^o + Im- The block circular 
matrix Ct can be decomposed as Cf = Fjy^C/Fjvf, where 
C/ = blkdiag([Co , ■ • • , C^^.^]^). We note that by setting 
Ct = Imn,, FD-DFE reduces'to FD-LE. 

III. Optimal Minimum MSE FDE Filter Design 

In this section, we derive the optimal minimum MSE 
equalization filters at the destination and the corresponding 
error CV matrices at the output of the equalizer as functions 
of the source and relay precoding matrices. Combining ([T])-(|5]l 
and ([8]l, the error CV matrix, E = i?[ee^^], can be expressed 

-Correct feedback is a common assumption for the design of decision 
feedback equalizers II 11 - 1131 . 



as 

E = F\,(Wf{a',QfQ\+Kf)W\-a'^WfQfC} 

-a2C/Q}w} + (T2C;Ct)FM. (9) 

Following the conventional equalization design methodology, 
the optimum FD FEE is obtained by minimizing the sum of 
stream MSEs, tr(E), which yields 

W} = a',CfQ\{a'MfQ\+Kjy\ (10) 

Substituting into (|9|l and simplifying the resulting expres- 
sion, the CV matrix can be rewritten as 

E = a2Fl,,C/*7ictFM, (11) 

where = blkdiag([*;5^, • • • , *^^„i]^) G C^in.>^mn, 
with 

= ^'sQI (^'HfeAfeA[.H[. + allN.y^ Qfc + Im. (12) 

From ( fTTT i we observe that E is a block circular matrix. Hence, 
its block diagonal entries, E„ G C*^^*^, Vn, are identical, i.e., 
E„ = EjVti. Since the diagonal entries of E„ represent the 
MSEs of the different spatial streams at time n, symbols from 
the same stream experience identical MSEs. CV matrix E can 
be conveniently written as 

2 

E=#EcI*fc^C,. (13) 

k=l 

A. CV Matrix and Filter Design for FD-LE 

Eqs. ((Tol) and (O are valid for both FD-LE and FD-DFE. 
For the special case of FD-LE, we can set C/ — Imn^, which 
leads to 

W} = a,2Q}(a2Q/Q} + Ky)"' (14) 
and CV matrix 

2 

^ fc=l 

Interestingly, Efd_le is equal to the arithmetic mean of the 
subcarrier CV matrices, ^^T^, in MIMO-OFDM relay systems 

m. 
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B. CV Matrix and Filter Design for FD-DFE 

The FD-DFE CV matrix depends on the FD-FBF matrices 
Cfc. Since the FBF has to be implemented in the TD, we 
express in terms of the TD FBF coefficients Ct_„ as Cfc = 
Now, (fTsT l can be rewritten as 



EpD-DFE 



N, 



fc=0 n=0 

^EE(c..E*.-v 



n— m— 



fc=0 




CZCT 



(16) 



To simpHfy the notation, we have used the definitions C = 

[Ct,o, Ct,Nft] and 



Z ^ 



4 



Zl 

Zo 



ZATji.-! 



Zo 



(17) 



where z,. 



fe=0 



- kn 



The optimal C minimizing 



tr{EFD_DFE} can be obtained by solving 

tr(CZC^), 



mm 

C©=Cf,o 



(18) 



where — [Im ,OMx(Nfb-i)]- Problem ( fTSl l can be solved 
using the standard Lagrange multiplier method, leading to llT3l 



By partitioning Z and Z^^ as 



Zii 



Zl2 
Z22 



Uii 

Ul2 



U12 
U22 



(19) 



(20) 



where {Zn,Uii} £ C*^x^^, {Zi2,Ui2} G C*^xa^^/^ and 
{Z22,U22} e C'^Nf.xMNf,^ q f^j-jj^gj. expressed as 



oZi2Z22"^] 



Substituting (I2TI1 into ([T6]l, the FD-DFE CV mati'ix can be 
rewritten as 

Efd-dfe — T7"Ct,oUj]^^Cj_Q. (22) 



To complete the FBF design, the optimal Ct.o has to be deter- 
mined. To this end, we introduce the Cholesky decomposition 
of as 



V^^ = LT>V, 



(23) 



where L is a unit-diagonal lower triangular matrix and D is 
a diagonal matrix with positive main diagonal entries. Now, 
it is easy to verify that the optimal Ct.o which minimizes 
tr(EFD-DFE) is given by Cj q = L^^. Hence, the optimal C 
is obtained as 



C* — [L L ■'"Z12Z22"'"] 



(24) 



The structure of the optimal FBF can be interpreted as follows: 
£ (j^MxM ^ lower-triangular matrix which cancels the 
inter-antenna interference (lAI) in the current time slot, and the 
remaining FBF coefficients, L-iZi2Z22^ e (^MxMNf,^ 
both the lAI and inter-symbol interference (ISI) stemming 
from the previous Nfi, — 1 time slots. Inserting C* into 
the error CV matrix can be written as 



Efd-dfe — J^'^' 



(25) 



Interestingly, unlike for FD-LE, the error CV matrix for FD- 
DFE is a diagonal matrix, which also depends on the number 
of FBF taps Nfb. 

IV. Source and Relay Precoding Matrix 
Optimization 

Exploiting the expressions for the error CV matrix obtained 
in the previous section, in this section, we minimize a general 
function /(diag[E]) of the spatial stream MSEs at the output of 
the equalization filter under separate constraints on the powers 
consumed at the source and the relajH. Mathematically, the 
optimization problem is stated as 

min /(diag[E]) 

{Pfc,Afc} 

s.t. tr (S[xxt]) < Ps, tr {E[tt^]) < Pr,(26) 

where E = Efd-le and E = Efd-dfe for FD-LE and FD- 
DFE, respectively, P5 and P7? are the power budgets for 5 
and R, respectively, and diag[M] denotes a vector contain- 
ing the main diagonal entries of matrix M. The objective 
function /(diag[E]) can be either a Schur-convex or a Schur- 
concave increasing function with respect to (w.r.t.) diag[E] 
lfT4l . For concreteness, in this paper, we consider the three 
most important objective functions of this type, namely the 
ai-ithmetic MSE (AMSE), the geometric MSE (GMSE), and 
the maximum MSE (maxMSE) 



(21) /(diag[E]) 



--1 E„ 



AMSE 
GMSE 



m— 1 mm ' 

^ maxMSE 



(27) 



where E^,„ denotes the mth diagonal entry of E. The AMSE 
and GMSE are Schur-concave functions while the maxMSE 
is a Schur-convex function w.r.t. diag[E] jTU. We note that 
similar objective functions have been considered for MIMO- 
OFDM based relay systems in [6|. However, for MIMO- 
OFDM based relay systems, the AMSE, GMSE, and maxMSE 
are the sum, product, and maximum of the subcarrier MSEs of 
different spatial sub-streams. In contrast, in ( l27l ). these three 
quantities are the the sum, product, and maximum of the sub- 
stream MSEs of a single carrier 



We note that our derivations can be extended to a joint source and relay 
power constraint. While such a joint power constraint offers more degrees 
of freedom for the system design, separate power constraints appear more 
practical since usually the source node and the relay node have their own 
power supplies. 
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The power consumptions at source and relay are given by 
tr (£;[xxt]) = ^ tr (p^P^) , 



k=0 



(28) 



fe=0 



Since the optimization variables in Goi are matrices, solving 
the problem directly would incur high complexity. In the 
following, we will first derive the structure of the optimal 
precoding matrices. Knowing this structure will allow us 
to transform the optimization problem into an optimization 
problem with scalar variables. 

A. Structure of the Optimal Precoding Matrices for FD-LE 

We first derive the structure of the optimal source and 
relay precoding matrices for FD-LE. We begin by introducing 
the following singular-value decompositions (SVDs) of the 
channel matrices 



Crfc - Ug ' 



Hfe = U^^A^V^^t^ yk, (29) 



where U^"^ G C^'-'^^-, V^^'^ e C^=><^= and 



(k) 



l^rxNr ^.g (.jjg singular- vector matrices 
of Gfe and H^:, respectively. Furthermore, Aq^ e C^''^^" 
and A^"* e C^'*^^'' are the singular-value matrices of Gfc 
and Hfc, respectively, and have both increasing main diagonal 
elements. 

Theorem 1: For the optimization problem in ( |26] |. the fol- 
lowing structures of P^ and A^: are optimal 



P^ - V^^^A^'^^ 



Vo, A^ 



H 



A 



'G ' 



Vfc, (30) 



where V^'', u|^'', and V^'' contain the M right-most 
columns of V^?-*, U^', and V^'', respectively. Ap-* and A^' 
are M x M diagonal matrices with the mth diagonal element 
denoted by pkm and akm, respectively. For Schur-concave 
functions, Vq = 1m- For Schur-convex functions, Vq is a 
unitary matrix chosen in such a way that all main diagonal 
entries of E are equafl 

Proof Please refer to the Appendix. ■ 
Theorem 1 implies that for Schur-concave functions, the 
source and relay precoding matrices jointly diagonalize the 
MIMO relay channels at each frequency tone, while for 
Schur-convex functions, the precoding matrices diagonalize 
the channels up to a unitary rotation at the source. Therefore, 
the original optimization problem involving matrix variables 
can be transformed into a scalar power optimization problem 
across different spatial beams and frequency tones. 

B. Transformation of Optimization Problem for FD-LE 

Since the maxMSE is a Schur-convex function, according to 
Theorem 1, the unitary matrix, Vq, should be chosen to make 
all diagonal entries of E equal. Recall from Section III that 

^ In practice, Vo can be chosen as a DFT matrix or a Hadamard matrix 
with appropriate dimensions. 



diag[E] represents the MSE of different spatial streams and all 
symbols of a particular stream have the same MSE. This means 
that for maxMSE, identical MSE is achieved for all symbols 
in the SC-FDE system. Hence, the remaining maxMSE power 
allocation problem is identical to that for the AMSE criterion. 
The only difference between the solutions for maxMSE and 
AMSE minimization lies in the choice of Vq. We note that this 
is not true for MIMO-OFDM relay systems, where the unitary 
transformation at the source only achieves identical spatial 
MSEs on each subcarrier, while the MSEs across subcarriers 
are in general different. To balance these MSEs, multilevel 
waterfilling has to be carried out in such MIMO-OFDM relay 
systems, which entails a much higher complexity compared to 
the single-level waterfilling required for the AMSE criterion, 
cf. HI. Additionally, for MIMO-OFDM relay systems, the 
unitary rotation matrices are in general different on each 
subcarrier as the number of transmitted data streams may 
vary from subcarrier to subcarrier. However, for SC-FDE, the 
rotation matrices are identical for all frequency tones since the 
number of data streams is determined in the time domain. 

Because of the equivalence of the power allocation problems 
for maxMSE and AMSE, in the following, we focus on the 
power allocation problem for the AMSE and GMSE criteria. 
From ([T2]l and OU]), we obtain = vJ^^Vq with 



^2 » (fc)2T(fc)2 . (fe)2 T(fc)2 

a^jiVp iVg iv^ 



(jIi 



M 



(31) 



~ (k) - (k) 

where A^ and A}^ are diagonal matrices whose diagonal 
entries contain the M largest singular values of G'^'') and H''^^ 
respectively. Now, we can rewrite the objective functions as 



./x(*) = 



(l^E^^o'<f^:),X=AMSE 
log2 (i: Ef=o ' : X=GMSE 



where # = {$fem,Vfc, m} with 



km 



■^sPkmdkm'^km'^km 



1. 



(32) 



(33) 



'^v"'km"-km 

Here, gkm and hkm denote the mth main diagonal elements of 
K'^q' and A^-*, respectively, and represent the corresponding 
channel gains of the mth spatial stream on the /cth frequency 
tone. Note that, for the GMSE criterion, we have taken 
the logarithm of the original objective function to facilitate 
the subsequent optimization. Due to the monotonicity of the 
logarithm, the new objective function has the same optimal 
solution as the original one. The new objective function can 
be rewritten as 



M 



log2 (SINK™ + 1) , 



(34) 



where SINR,„ = (i; Ef=o ' - 1 is the signal-to- 

interference-plus-noise ratio (SINR) of the mth data stream. 
This implies that ( [34] i is essentially the negative sum of 
the channel capacities of different spatial streams. Therefore, 
minimization of the GMSE is equivalent to the maximization 
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of the capacity of the considered MIMO SC-FDE relay system. 
By exploiting dSOl ). the expression for the power consumption 
on the left hand side of the constraints in (|26] l can be expressed 
as 



tr 



k=0 



k=Q m=l 



? , 7 , Ps,km 



k=0 

Na-1 M 



tr(£;[ttt]) = Y.'^{^Ai-'s^P'^'a'' + -lU^^ 



(35) 



where 



PsMni = Cr^p^„, Pr,km = al^{(^'^sPkm9lm + ^l) (36) 

can be interpreted as the power allocated to the fcth frequency 
tone and the mth spatial stream at the source and the relay, re- 
spectively. By rewriting in terms of the newly introduced 
variables Pg.um and Pr.km as 



PsMmPrMmgi^K 



criPr,kmhl^ + crliPs,km9l„i + <^l) 



1. 



(37) 



problem ( 1261 ) can be reformulated as the following power 
allocation problem 

min fx f$) 

S-t. ^ ^ Ps,fem < ^'S, 

fc=0 m=l 

N^-\ M 

E 

fc=0 m=l 

Ps,fcm > 0,Rfc,„ > 0,V/c,m, (38) 



Pr,km < 



insights into the objective function for FD-DFE. First, it im- 
plies that the AMSE, i.e., tr(D), is lower bounded by the term 
involving the GMSE, i.e., det(D). Second, this lower bound is 
achieved i.f.f. the MSEs of all streams are identical. Therefore, 
making the diagonal entries of D identical will enable us to 
minimize the AMSE, GMSE, and maxMSE simultaneously. 
Consequently, for FD-DFE, the three considered objective 
functions become equivalent if this condition is fulfilled, and 
the value of the objective function is equal to det(D)M'. In 
the sequel, we will show how this can be achieved by properly 
choosing a unitary matrix at the source precoder. By noting 
that 



where the constraints Ps,km > 0, Pr.km > 0,Vfc,TO, ensure 
that the allocated powers are not negative. 

C. Structure of the Optimal Precoding Matrices for FD-DFE 

For the FD-DFE receiver, we observe from dZSl l that 
EpD-DFE is not an explicit function of optimization variables 
Pfc and Afc, which renders the optimization a challenging task. 
In the following, we will show that by using some proper trans- 
formations, an upper bound for the original objective function 
can be derived, which is equivalent to one of the objective 
functions considered for the FD-LE receiver. However, first 
we will show that for FD-DFE, the three considered objective 
functions are equivalent. 

1) Equivalence of Objective Functions: Since Efd-dfe 
in i25[ is a diagonal matrix, we invoke the following matrix 
arithmetic-geometric mean inequality 

M M 

-tr(D) = _^[D](,^,) > (HPIom))^ =det(D)^, (39) 

1=1 i=l 

where equality holds if and only if (i.f.f.) all main diagonal ele- 
ments of D are equal. The inequality provides some important 



det(D) = det(LDL^) = det(Uj;ii), 



(40) 



where we have exploited the properties that det(AB) 
det(A)det(B) and det(L) = 1 1 18|, we can obtain 



= LDl/2(LDl/2)t = (QR)tQR^ 



(41) 



where Q is an arbitrary unitary matrix of appropriate dimen- 
sion and R = (LD^/^)! 

is a lower triangular matrix whose 
main diagonal elements are equal to the square root of the 
main diagonal elements of D. Therefore, finding a matrix D 
with equal diagonal elements is equivalent to finding a matrix 
R with equal diagonal elements. Since it follows from ( |4T] ) 
QR, we have to find a matrix decomposition 



that U 



-1/2 



of Uj^j^ such that R has identical diagonal elements. Such a 
decomposition is referred to as equal-diagonal QR decomposi- 
tion (E-QRD) 1 16] or geometric-mean decomposition (GMD) 
ifTTl . Note that for an arbitrary matrix H, the standard form 
of the GMD is. 



SVl = QR, 



(42) 



where Vi is a unitary matrix chosen to make the diagonal 
entries of R all equal. Therefore, to facilitate the application 
of GMD, we rewrite Un as 



U 



-1/2 
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HVt 



(43) 



In the following, we will find the explicit expression for H to 
perform the decomposition in ( |42] |. By expressing Pfc as the 
product of Vi and a general matrix Pfc, 



Pfc = PkVl 
we can write as 'S'fc = Vi^fcV|, where 

*fc = ^'PlGiAtHt(a2HfcAfcA[.Hj 



(44) 



N„ 



HfcAfcGfcP 



(45) 



Note that ^fc has the same form of '4'fc in ( fT2] i but with Pfc 
replaced by Pfc. Therefore, matrix Z in ( fTTb can be written as 



Z = 



ViZoVl ViZiVl 
Yiz\v\ ViZoVt 



ViZAT^.v! 



Viz]v,.Vl Viz 



t 



ViZoVl 



= (I^,,®Vi)Z(Ia.,,®v1), 



(46) 
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where Z has the same form as Z in ( fTTj i with z„ replaced by 



. By noting that 

(Ijv,,®Vi)Z-\l^,,®Vl), 



where we have used {iNf^ 
from dlOll 



'Vi)-i=I 



Nf 



(47) 



) V|, we obtain 



(48) 



where Un is the first M x M submatrix of Z^^. Comparing 
(l48T l with ( |43] |. we observe that ^ii^^ is the expHcit form 
for H. Hence, for a given Un, we can always find a unitary 
matrix Vi which achieves the MSE lower bound. 

As Uii is a function of relay precoding matrix as well 
as the remaining part of the source precoding matrix, i.e., 
P/c, in the following, we need to determine these matrices. 
To this end, we write Un = (Zn - T^X^T.'^^.l'^-yiY^ ifTSl . 
which allows us to express the objective function for FD-DFE 
as 



OBJ = det(Uj;ii) = det (Zn - Z\^Z^^Zi 



(49) 



2) Upper Bound on Objective Function: Unfortunately, the 
expression for OBJ in (|49] | depends on the FBF length Nfb, 
cf. (|46] |. which is not desirable in practice. Additionally, due 
to the presence of in ( |49] |, it is also not straightforward to 
express the objective function in terms of A^ and Pfc. To avoid 
these problems, we derive an upper bound for OBJ, which is 
independent of Nfb and directly related to the optimization 
variables. 

Since matrix Z in ( [TtI i is a positive semidefinite (PSD) 
matrix, Z~^, Z22, and Un are PSD matrices as well. Thus, 
and Z|2Z22^Zi2 are also PSD matrices. By exploiting 
the fact that det (A + B) > det (A) if A and B are PSD 
matrices fTSl, we obtain 



dei{V^^) < det(Zii) , 



(50) 



where equality holds i.f.f. Nji, = 0. Therefore, for the case of 
Nfb = 0, det(Zii) is the exact value of OBJ. Otherwise, it 
is an upper bound for OBJ, which can be expressed as 



OBJ„6 = det(Zii] 




det 




■k * 1 I - I ^i: I ' (51) 

\k=l ) \k=\ 

where we exploited det(Vi) = 1. 

3) Structures of Optimal Source and Relay Precoding Ma- 
trices: Since we can always chose such that '^fc ^ 
a diagonal matrix, cf. i45[ . the determinant in ( fsTl i is es- 
sentially the product of the diagonal entries of J2^=i '^fc ^■ 
Consequently, OBJ„b is equivalent to the objective function 
for the FD-LE receiver under the GMSE criterion. Thus, from 
Theorem 1 we obtain the following optimal structures for P^ 
and Afc 



n 



V(fc) A (fc) 
V g iVp 



At =V^^^A 



(fc) A WfiWt 



^k — ^ H 



(52) 



And the optimal P^ is thus given by PfeVj. The remammg 
power allocation problem is identical to that for the GMSE 



criterion for FD-LE, cf. (l38l l. It is worth mentioning that 
for Nfb > 0, the upper bound OBJ„;, constitutes a tight 
approximation of the objective function OBJ as is illustrated 
in Section V. 

D. Optimal Power Allocation 

From the previous two subsections, it can be concluded 
that only two different types of power allocation problems 
have to be solved, namely the problems for the AMSE and 
GMSE criteria for FD-LE. The solutions to these problems 
are also applicable for the maxMSE criterion for FD-LE and 
all three criteria for FD-DFE. However, since the objective 
functions for the AMSE and GMSE criteria in (l38T l ai-e not 
jointly convex w.rt. the power allocation variables, the global 
optimal solution is difficult to obtain. Thus, in the following, 
we adopt a high signal-to-noise ratio (SNR) approximation for 
$fc„i, i.e., we assume cr^cr^ is sufficient small such that it can 
be ignored in the denominator of ( l37T i. which leads to 



km 



PsMnPr.km9lr.hl 



<-' v-'^r.km.l^kri 



1. 



(53) 



This approximation renders the optimization problem convex 
such that efficient methods can be applied for its solution. 
In particular, it can be shown that both objective functions 
/amse(*) and /gmse(*), * = {^km , Vfc,m}, are jointly 
convex w.rt. the power allocation variables Pg.km and Pr,km, 
respectively. The convexity of the objective functions can be 
proved straightforwardly by showing that the Hessian matrix 
w.r.t. Ps,km and Pr,km is positive semi-definite. We omit 
the proof here because of space constraints. Furthermore, all 
power constraints are affine in Pr.km and Ps,km- Thus, the 
optimization problem in ( l38T l, with ^km approximated by 
^km, is a convex optimization problem. 

We are now ready to derive an iterative power allocation 
algorithm. To this end, we introduce the Lagrangian of the 
considered power allocation problem 

C = /x($fem) + P-^km - Ps] + l^[J2 Pr,km - Pr] 

k,m k,ra 

^ ^\ (^kmPs.km ^" ^kmPr.kim^j (54) 
k,m 

where A and p are the Lagrange multipliers for the sum power 
constraints for source and relay, respectively, and Pkm and 
7fc,„ are the Lagrange multipliers for the individual power 
constraints for source and relay, respectively. Applying the 
Karush-Kuhn-Tucker (KKT) conditions to (l54l i. which are 
sufficient and necessary conditions for convex optimization 
problems |19|, we obtain the optimal solution of the consid- 
ered problem as 



P. 



V r.km km 



P 



s,km 2 (p2 U2 i ^2 ^ 

ykm\^rMm"'km ^ "u) 

« s.kmUkm 



9ln 



r.km 



U2 (p2 2 I 2 
"-kmy^ s.kmilkm, ^ "v 



A(ln2)S„a>2 

I him 

p{\n2)Braalal 



- 1 



(55) 
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TABLE I 

Algorithm for finding the optimal power allocation, ei and e2 

ARE SMALL CONSTANTS, E.G. ei = 62 = 10^'*. 



1 


Initialize and A^^' 


2 


Initialize pI''1^,pI''1^, Vfe, m. 






Set p;,r™ = pj^L. -p;"™ = -p'^i^.vfe, ™- 


3 


Repeat 






Set iteration number to n — 2. 






Repeat 






lOr 7n — ± , ivl , K — i . I\ c 






Find from (55) using P^^^^"^^ 


and A^^-^l 




end for 






Update A^"^ using (56). n — n + 1 






until |a[''+^J - A^^^l < €i. set P^^^n 






Set iteration number to / — 2. 




Repeat 






for m ^ 1 : A-/, ^ 1 : Na 






Find P^^J^,^ from (55j using Pj^l''^ 


,andM"-'i. 




end for 






Update ^[^3 using {sTJ. / ^ / + 1. 






until + - M^'^ 1 < €2, set P;^^'^^ 






until Pjl'l'^ and PJ ^^^t, converge. 


4 


Pj "^^^^j^ and P^ Vfc, m, are the optimal solution. 



where = 1 and = J2k=ii^km + 1) 
and the GMSE criteria, respectively, and [x 
The Lagrange multipliers A and /i, which are chosen to satisfy 
the sum power constraint for source and relay, respectively, can 
be found with the following subgradient method ||T9l 



1 for the AMSE 

+ = max(0, x). 



[n+l] 



- £l 




s.km 



-Ps 



r.km 



(56) 



(57) 



where n is the iteration index, and e^, i = 1, 2, are step sizes. 
From ( fSST l, we observe that the optimal Pg.km depends on 
Pr,km and vice versa. To tackle this problem, we propose 
the algorithm in Table I to iteratively find the optimal power 
allocations. Convergence of this algorithm to the optimal 
solution is guaranteed because of the convexity of the con- 
sidered optimization problem. Note that if either Ps,km or 
Pr.km is equal to 0, the other variable will also be 0. This 
result is intuitively pleasing since, if for example the (m, A;)th 
subchannel is shut down in the S-R Unk, there is no need 
to waste power on this subchannel in the R-D link. It is also 
worth noting that for the GMSE criterion, Pg^km and Pr^km are 
functions of ^fc,,,, which means the optimal Pg^km and Pr,km 
for the fcth frequency tone depend on the power allocations 
in all other frequency tones. Therefore, finding the optimal 
solution requires a higher complexity for the GMSE criterion 
than for the AMSE criterion. 

E. Suboptimal Power Allocation Schemes 

Since the proposed precoding matrix optimization scheme 
involves an iterative power allocation algorithm and consid- 
erable information exchange between source and relay, it 
is desirable to investigate suboptimal approaches with lower 
complexity and reduced feedback overhead. One option is to 
adopt equal power allocation at the source and to optimize only 
the power allocation at the relay. We refer to the corresponding 



scheme as EPA-S. EPA-S eliminates the need for information 
exchange between source and relay for power allocation, 
hence guaranteeing faster convergence of the power allocation 
algorithm. However, the EPA-S scheme still requires CSI 
feedback from the relay to the source for computation of the 
source precoding matrix. In order to completely avoid CSI 
feedback, one can perform precoding at the relay only, which 
we refer to as ROP scheme. For FD-DFE, we also introduce 
the UPS scheme, which applies only the unitary precoding 
matrix Vi at the source. This is motivated by the result in 
Section IV-B, where it is shown that this unitary matrix can 
balance the MSEs of the different spatial streams. Similar to 
ROP, the UPS scheme has the advantage of a reduced feedback 
overhead compared to optimal power allocation and the EPA- 
S scheme as the source only needs to acquire knowledge of 
the M X M unitary matrix Vq. 

V. Simulation Results 

In this section, we evaluate the performance of the pro- 
posed source and relay precoding schemes using simulations. 
We assume that each data block contains Nc — 64 sym- 
bols. The channels are modelled as uncorrected Rayleigh 
block fading channels with power delay profile p[n] — 

irEtV'e-"^"''^!" - ^] ["201, where € {Lg^L^} and 
at = 2, which corresponds to moderately frequency-selective 
fading. For convenience, we set the values of Lg, L^, Ng^s, 
and Ng^r all equal to 16. Unless stated otherwise, Nfi, is 
set to 15. We assume identical noise variances for both 
links, i.e., cr„ = cr^, and define the source and relay SNRs 

as (Eb/Noh ^ 7!^^^ {Eb/No)r ^ TfJ^, 
respectively, where Ni, is the number of bits per symbol. 
For all simulation results shown, we set {Ei,/No)s — 16 dB 
and examine bit error rate (BER) and capacity as functions 
of {Eb/Nf))r- All simulations are averaged over at least 
10,000 independent channel realizations. In the following, the 
proposed joint source and relay precoding design is referred to 
as JSR, and the notation {M^Ns,Nr,Nd] is used to specify 
a system with the parameters appearing in the bracket. 

A. Convergence of the Algorithm and Tightness of OBJub for 
FD-DFE 

In Figs. 2 and 3, we show the value of the objective function 
for the GMSE criterion versus the numbers of inner and outer 
iterations for different values of {Eb/No)r for a {2,2,2,2} 
MIMO relay system 0. We define an outer iteration as one 
optimization of {Ps,km} or {Pr.km} in the algorithm shown 
in Table I, and the update of {Ps,km} ({Pr,km}) in each outer 
iteration constitutes one inner iteration. The reference lines 
indicate the optimal values of the objective function. Figs. 2 
and 3 show that it takes at most ten inner iterations to obtain 
an intermediate solution for {Ps.km} or {Pr.km}^ and at most 
three outer iterations to obtain the final solutions for {Ps,km} 
and {Pr,km}- Also, from Fig. 3, we observe that the largest 
improvement of the objective function value is obtained in 
the first and second outer iterations when {Eb/No)r is small 

^Similar results also hold for the AMSE criterion. 



9 



LU 


-2.5 


« 








CD 


-3 


<D 




> 


-3.5 






o 


-4 





\. . . ^>Reference line 


(Ej^/Njj)^=8 dB 




^ru^. ........ 


(E^/N„)=16dB 







10 15 20 

number of inner iterations 




10 15 

(EH/N„),(dB) 



Fig. 2. Objective function value (GMSE criterion) versus number of inner Fig. 4. Objective function value of the FD-DFE receiver for different values 



of N 



fb- 



-1 



-2 



O -5 



-6 



-7 



\ 


\ ; (Eb'No)rOdB 


s \ 
t - 


\ ^^Reterence line 

V/ / (V^'r'SdB 

\>v / ■ ^ 






7 





5 10 15 20 25 30 

number of outer iterations 



Fig. 3. Objective function value (GMSE criterion) versus number of outer 
iterations. 

and large, respectively. This observation suggests that for low 
SNR, optimizing the source or the relay power is sufficient 
to realize most of the achievable performance gain, while for 
high SNR, a joint optimization of the source and relay powers 
is beneficial. 

In Fig. 4, we show the values of the objective function, 
OBJ, for FD-DFE, cf. for different values of Nfb- Note 
that OBJ for Nfb — serves as the upper bound, OBJ„f„ for 
the general objective function. From the figure, we observe 
that the upper bound is very close to the objective function 
for all considered values of Nfb, especially for medium to 
high SNR and for the {2,3,3,3} system. Therefore, OBJ„;, 
constitutes a good approximation for the objective function for 
the FD-DFE receiver. 

B. Comparison of SC-FDE and OFDM for JSR Precoding 

In Fig. 5, we show the BER of uncoded quaternary phase- 
shift keying (QPSK) as a function of {Eb/No)r for the 



proposed FD-LE based MIMO relay system for the three 
considered precoding matrix optimization criteria. For FD- 
DFE, only the GMSE criterion is considered as for FD-DFE all 
three criteria are equivalent. For comparison, the performance 
of a MIMO-OFDM relay system optimized under the same 
criteria is also included |6|. The figure shows that for the 
{2, 2, 2, 2} system, the proposed MIMO relay system with 
an FD-LE receiver outperforms the corresponding OFDM- 
based system by a large margin since, in contrast to uncoded 
OFDM, FD-LE is able to exploit the frequency diversity 
offered by the channel. In addition, for both FD-LE and 
OFDM, the system employing the maxMSE criterion offers 
the best performance since the worst-case MSE is minimized. 
For FD-DFE, the performance improvement compared to FD- 
LE and OFDM is remarkable and a much higher diversity gain 
is observed. On the other hand, for the {2, 3, 3, 3} system, we 
observe that the performance gaps between FD-DFE, FD-LE, 
and OFDM become smaller Surprisingly, using the maxMSE 
criterion, the optimized OFDM and FD-LE systems achieve 
a performance very close to that of FD-DFE. This is due 
to fact that the additional antennas offer additional spatial 
diversity which helps OFDM and FD-LE to effectively avoid 
the deep spectrum nulls that otherwise negatively affect their 
performance in frequency-selective fading. 

In Fig. 6, we investigate the capacity of the OFDM and SC- 
FDE systems with different optimization criteria. As expected, 
the systems optimized under the GMSE criterion have the 
best performance since minimizing the GMSE is equivalent to 
maximizing the capacity. In general, the capacity achieved by 
the considered MIMO-OFDM relay systems is higher than that 
of the corresponding FD-LE relay systems, except for the case 
when both systems are optimized based on the maxMSE crite- 
rion. Indeed, the OFDM system optimized under the maxMSE 
criterion suffers from the worst capacity performance among 
all the considered schemes since the available power is mainly 
used to improve the MSE of the subcarriers with bad channel 
conditions instead of taking advantage of the subcarriers with 
good channel conditions. In addition. Fig. 6 shows that forFD- 
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Fig. 5. Uncoded BER of {2, 2, 2, 2} and {2, 3, 3, 3} MIMO relay systems 
for JSR precoding using different optimization criteria. 
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Fig. 6. Capacity of {2, 2, 2, 2} and {2, 3, 3, 3} MIMO relay systems with 
JSR relay precoding for different optimization criteria. 

LE, the AMSE and maxMSE criteria lead to exactly the same 
capacity, which implies that the unitary rotation of the source 
precoding matrix does not influence the capacity of the system. 
Furthermore, the capacity achieved with FD-DFE is larger 
than that achieved with any of the FD-LE schemes and very 
close to that of OFDM. This is due to the lower stream MSEs 
of FD-DFE compared to FD-LE, which translates into larger 
stream SINRs and larger system capacity. For the {2, 3, 3, 3} 
system, we observe that FD-LE and OFDM achieve almost the 
same performance for the AMSE and GMSE criteria, implying 
that with more source/relay/destination antennas, FD-LE will 
approach the achievable capacity of the OFDM system. 

C. Performance of Suboptimal Power Allocation Schemes 

In Figs. 7 and 8, we plot the uncoded and coded BERs for 
the suboptimal power allocation schemes discussed in Section 
IV-D for a {2,2,2,2} system, respectively. For the coded case. 



the standard rate- 1/2 convolution code with generator matrix 
(133, 171)oct is adopted. The OFDM and FD-LE systems are 
both optimized under the maxMSE criterion. The FD-DFE 
system is optimized under the GMSE criterion since for FD- 
DFE all three considered criteria are equivalent to the GMSE 
criterion. From Fig. 7 we observe that for uncoded transmis- 
sion, the FD-LE system outperforms the OFDM system if 
both employ the same precoding technique. Fig. 7 also shows 
that for FD-LE and OFDM, EPA-S and ROP suffer from 
a considerable performance degradation compared to JSR, 
while for FD-DFE, the performance loss is relatively small 
for UPS and almost negligible for EPA-S. By coding across 
subcarriers, OFDM-based relay systems can also exploit the 
frequency diversity of the channel and significantly improve 
their BER performance, cf. Fig. 8. Nevertheless, the coded 
FD-LE system still outperforms the OFDM system if the 
same precoding technique is assumed in both cases. Also, 
Fig. 8 reveals that channel coding significanlty reduces the 
performance loss caused by suboptimal precoding techniques 
for both OFDM and FD-LE. 

Since the performance of FD-DFE depends on the the 
number of FBF taps, in Fig. 9, we investigate the influence 
of Nfjj on the performance of a {2,2,2,2} system. The results 
show that while the value of Nfi, has limited impact on the 
performance of EPA-S, it does play a critical role for UPS. 
The reason is that for EPA-S, the equivalent S-R-D channel 
is diagonalized into M parallel channels, thus eliminating 
the inter-stream interference at the receiver. However, for the 
case of UPS , the equivalent end-to-end channel is not fully 
diagonalized and the received symbols experience inter-stream 
interference. Consequently, a FBF with sufficiently large Nfb 
is required to cancel out this interference. As can be inferred 
from Fig. 9, there is a complexity tradeoff between the trans- 
mitter and the receiver for FD-DFE. For EPA-S, since a small 
number of FBF taps (e.g., iV^f, = 3) is sufficient to achieve 
good performance, the receiver complexity is similar to that of 
FD-LE. However, comparatively complex FD signal process- 
ing has to be carried out at the transmitter This characteristic 



11 




-2 -1 012345678 

(E^/N„)^(dB) 



Fig. 8. Coded BER of a {2, 2, 2, 2} MIMO relay system with JSR and 
suboptimal precoding schemes. 

makes EPA-S suitable for the downlink transmission. For the 
UPS scheme, on the other hand, the transmit processing is very 
simple since the single tap precoding matrix Vi can be directly 
implemented in the TD. In addition, the feedback overhead 
is low as Vi is identical for all frequency tones. However, 
UPS requires a longer and thus more complex FBF to achieve 
a high performance. These characteristics make UPS a very 
promising scheme for uplink transmission. 




(E^/N„)^ (dB) 



Fig. 9. Uncoded and coded BER of a {2, 2, 2, 2} MIMO relay system with 
FD-DFE using JSR and suboptimal precoding schemes with different numbers 
of FBF taps. 



VI. Conclusion 

In this paper, we have tackled the problem of transceiver 
design for MIMO relay systems employing SC-FDE. The 
optimal MMSE FD-LE and FD-DFE filters at the destination 
were derived, and the optimal structures of the source and 
relay precoding matrices were obtained in closed form for a 



general family of objective functions. For systems employing 
an FD-DFE receiver, we first showed that the considered 
objective functions are all equivalent and derived an upper 
bound on the original objective functions, which was shown 
to be equal to the GMSE objective function for the FD- 
LE receiver The remaining power allocation problem was 
solved globally by using a high SNR approximation of the 
objective function and efficient convex optimization methods. 
Our results show that the proposed SC-FDE relaying schemes 
outperform the corresponding OFDM schemes in terms of both 
coded and uncoded BER. In addition, the performance gap 
between SC-FDE and OFDM relay systems decreases when 
the number of source/relay/destination antennas is larger than 
the number of data streams. Furthermore, we have shown that 
the proposed suboptimal power allocation schemes can reduce 
the system complexity and feedback overhead at the expense 
of a moderate performance degradation, especially in case of 
coded transmission, making them promising candidates for 
practical relay systems. 

Appendix 

We first provide some relevant definitions and lemmas that 
will be used in the proof. 

Definition 1 [15,l.A.l]: Given two x 1 real vectors 
X, y e M^. Let a;[i], • • • , x^atj and yjij, • • • , yjTv] denote the 
components of x and y in decreasing order. Then, x is 
majorized by y, or x ^ y, if X^jLi < J2t=i y[i] for A: < TV 
SiLi ^[i] — X^iLi y[i]- Vector X is weakly majorized by 
y, or X y, if J^i^i ^[i] < J2i=i yii]^^. 

Definition 2 [15,3.A.l]: A real function / is Schur-convex 
if for X ^ y, we have /(x) < /(y). Similarly, / is Schur- 
concave if for x ^ y, we have /(x) > /(y). 

Lemma 1 [15,9.B.l]: For a Hermitian matrix A with 
diag[A] and A(A) denoting vectors containing the main diag- 
onal elements and the eigenvalues of A arranged in decreasing 
order, respectively, we have diag[A] ^ A(A). 

Lemma 2 [15,9.H.2]: For M matrices A, e C^^^,i = 
1, • • ■ , M, let B = A1A2 • • • Am- Then, cr(B) cr(Ai) 
cr(A2) ■ • • (j{Am), where o'(X) denotes the vector contain- 
ing the singular values of matrix X arranged in decreasing 
order and denotes the element- wise product of two vectors. 

Lemma 3 [15,3.A.8]: A real function / satisfies x y => 
/(x) < /(y) if and only if / is Schur-convex and increasing. 

Lemma 4 [15,9.H.l]: For two Hermitian positive semidef- 
inite matrices {A,B} G C^^^ with eigenvalues XA.i,^B,i, 
arranged in the same order, we have tr(AB) > 

^A.iAB.AT-i+l- 

Lemma 5 [15, p. 7]: For a vector x G C^^^, we have 
^^^(li/iV)! -< X, where 1 is the all-ones vector 

Lemma 6: For Hermitian matrices A^ and A^, k = 
l,2,---,Nc , if diag[Afe] ^ diag[Afe], we have 
diagE^^iA,] ^diagE^^iAfe]. 

Proof: According to Definition 1, if diag[A/j] -< diag[Afe], 
we have ELi ^ Y.Ui^'M for 3 < ^ and 
Y.i=iO'[k{\ = Y.f=iO-[k{\ for 3 = N, where a^^i] and 
a[ki] are the zth diagonal entries of A^ and A^:, respec- 
tively. Since a[j,j] and a[j.j] , Vfc, i, are non-negative, we have 
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Ef;iE-=ia[H] < Ef=iE-=i«[fe.] for J < and where Q« = VV^^U\,^^ and 
Ek=i E»=i a[ki] = 1:^=1 E^=i for J = N. Therefore, Lemmas 1 and 2, we obtain 



(fe) 



f2('=)U^\ By applying 



we have I]fe=i diag[Afc]^ Z]fc=i diag[Afe], which is equiva- 
lent to diagEfeli Ak] ^ diag[Ef^i A,.]. 

Lemma 7 [15, 9.B.2]: For a diagonal matrix D G C^'^^', 
there is a unitary matrix U such that A = U^DU has identical 
diagonal entries equal to tr(D)/il/. 

We now set out to prove the optimal structure of the source 
and relay precoding matrices when /(diag[E]) is a Schur- 
concave increasing function w.rt. diag[E]. Let us begin with 
the core term in the expression for E in ( fT2b . which is given 
by 



=1 



diag[Tfc] < A(Tfc) 



diag[(I 



2AW-2^-l(I^^+^2^(fc)-2,-ll 



(65) 



Therefore, diag[Tfc] is majorized by diag[Dj'^''D2'^''] when 
V^-' = Hi, Q^''^) = H2, Clf^ = H3, where H, e C^^x*^V^, 
are arbitrary diagonal matrices with unit norm diagonal ele- 
ments. Without loss of generality, we can choose ~ Im, Vi. 
Hence, we have 



X ~ 



f^(fe) ^U(fc)t^ V(f) =U(^)^ (66) 



(58) 

where we employed the matrix inversion lemma. The CV 
matrix can now be expressed as 



According to Lemma 6, 



diag[5]Tfe] diag[^(D('=)D('=))] 



k=l 



fe=l 



E 



2 

^ k=l 



■{NcIm - ^Tk). (59) holds. From the fact that -/(diag[cr2l^^ 



(67) 



k=l 



Since /(diag[(TglA/ — jt^k=i'^>^]) a Schur-concave 



decreasing function w.r.t. diag[^^;^ T^], — /(diag[cr^I 



M 



jf- '^k=i k]) is a Schur-convex increasing function 



w.r.t. diag[^^j^ T/j]. T/j can be further expressed as 
Tk - JLlYiiJkJi + allMj-'YkXk, 



(60) 



where we have used the definitions 



Xfe = (TsGfcPfe, Yk = HfeAfe, 
Jfc - Yfc(XfeX[. + (72l^Ji/^ (61) 

By rewriting in terms of X^, and J^., we obtain Y^. = 
Jfe(XfeXj, + cr2l^J-i/2 Plugging this result into ^ leads 
to 

Tk =X,(X,.Xi + a2l^rj-l/2jt (J^jt + ^2j^j-l 

Jfc(XfcX[. + a2ljvJ-i/'Xfe. (62) 
Now, we define the following SVDs 



^k - -^^X ^ X ^ •'fe - ^,7 ^^.7 ' .7 ' 

(X,xt + all^X'^' = vPiA'^P' + allM)-'^'n^'\ 



(63) 

where U^'' e C^-xa^, U^'^ G C^^'X^^ {Yf,^!^^^ e 
CAfxw.^ {A^\v^\a^*^^} e C*^xM Tjjg diagonal entries 

(fe) (fe) 

of Ax and Aj are both sorted in increasing order. We can 
use (|63]) to rewrite ( |62] ) as 

Tfc =v(,^)U«Q«t(AW2 ^^2i^^)-i/2Q(fe)t 



X (Im + aSA(;)-2)-iQf )(aW2 + allM)-'/' 

v(fc) A(fe-)^r(fe) 



xQ^A:^^^v^^^ 



(64) 



is a Schur-convex increasing function w.r.t. diag[^^j^ Tfe], 
( l67b . and Definition 2, we deduce that — /(diag[cr^lM — 
^Ell^^iT.]) < -/(diag[a2lM - ^E^iD^'^'Df]), 
which is equivalent to 



f{di^g[a%,~^Y.'^k]) 

^ k=l 
2 

>/(diag[a,^lM-^EDf^D('=']). 



(68) 



k=l 



Therefore, with the help of the matrices in (I66] l. the value 
of the objective function can be reduced to /(diag[crslAf — 



In the following, we derive the structure of the optimal 
source and relay precoding matrices minimizing the power 
consumption of the source and the relay. For simplicity of 
notation, we only consider the case when Ng ^ Nr — Nd- 
The proof can be easily extended to the case where Ng, 
Nr, and Nd have different values. From ( |63] |. we have 
cr^u[?^A^*^V[j^^^Pfe = V^A^pvP, which can be used 
to express the source power consumption at frequency tone k 
as 



(/c)2^(fe)tu(fe)^(fc)-l, 

yC G G > 



> HA^t 



-2^W2) 



(69) 



where the inequality follows from Lemma 4, and the diagonal 
matrix Aq"* e ^AfxM contains the M largest singular values 
of Gfc. Therefore, in order to minimize the source power, we 



need to choose Q 



(fe) 



[I 



M 







Mx{N, 



Ug', where U^' contains the M right-most columns of Uq''. 
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RecalHng from ^ that V^^' = Im, the source matrix can 
be expressed as 

P, = Iv^Kp-^A^P = V^^^A^P, (70) 

where Vg ' contains the M right-most columns of Vg '' 
and A^^^ ^ ^A^^'^^^A^^ Next, from (|63]l we obtain 
^W^Wv^t^'^ ^ Jfe(XfcX^+a2l„J-i/2. Using this re- 
suh, we can express the relay power consumption at frequency 
tone k as 

= tr(A(J-)-u(,^'tj^jtu«A«-) 

>tr(A(J)-2A«2)^ (71) 

where the inequality follows from Lemma 4, and the diagonal 
matrix A^-* e C^^*^ contains the M largest singular 
values of H/j. In ( TtTT i. equality holds for Q4'^'' = [Im 
OMx{Nr--M)]'^ 7 Uj*^-* = U^-*, where U^'' contains the 
M right-most columns of Recalling from (|66] | that 

U^)^ = OC^) = V^''^ and from ^ that U^'^ = U^*^^ 
we obtain for A^ the expression 

A. = YPAp-'Af{AP'+allM)-'/'lJ^^^' 

- V^^^A^f^U^j^^t^ (72) 

where V]^ and contain the M right-most columns of 
V^) and Vl^\ respectively, and A^*^ ^ A^^-^ A^j'\a^P^ + 
ct^Ia/ )~^^^- Hence, we have proved that the expressions for 
the source and relay precoding matrices given in ( |70] i and (|72| i 
minimize the objective function /(diag[E]), cf. ( l68T l, as well 
as the power consumption at the source and the relay, cf. (|69] l 
and (|7T). 

Now, we turn our attention to the case when /(diag[E]) is a 
Schur-convex increasing function w.rt. diag[E]. From Lemma 
5 we know that jgtr(E)l -< diag[E]. Combining this fact and 
Definition 2, we obtain the inequality 

/(diag[E]) > /(^tr(E)l), (73) 

where equality holds when the diagonal entries of E are 
all equal to -p-tr(E). In the following, we show that by 
applying a unitary rotation to the source precoding matrix, 
we can achieve this equality. Using the eigenvalue decompo- 
sition QlK-^Qk = U^^^^A^^^U^^^^ we can write ^^T^ = 
JJ^ + A^'')^^U^''^ Let us consider the feasible source 

~ (k) 

precoding matrix = PfcU^ Vq, where Vq is a unitary 
matrix and thus does not affect the power constraints. Replac- 
ing Pfc with Pfc in Vl/jT^, we arrive at 

=vI{Im + aP)-^V„, (74) 

which allows us to express the error CV matrix as 

2 iv 2 -'^^ 

E = ^E.:,*.-^ = ^(iM + Ai,^))-Vo. (75) 

k=l 



Since X]fc=i(-'-Ji-f + ^'^^ °f diagonal ma- 

trices, it is also a diagonal matrix. Based on Lemma 7, we 
conclude that there exists a unitary matrix Vq such that E 
has identical diagonal elements given by jg.tr(E). Since the 
objective function is an increasing function w.r.t. its arguments, 
minimizing the original Schur-convex objective function is 
now equivalent to minimizing -jgtr(E), which is a Schur- 
concave function. Therefore, the optimal structures of Pfe and 
Afe are given by (iTOl i and (|72] |. respectively. Furthermore, as 
the resulting U^^ can be shown to be an identity matrix, the 
source precoding matrix for Schur-convex functions is given 
by Pfc = PfcVo = v(^^a(.^Vo. 
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